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Does Face Image Statistics Predict a Preferred Spatial Frequency for Human Face 

Processing? 

Matthias S. KeiE 

Basic Psychology Department, Faculty for Psychology, University of Barcelona (UB), 
Passeig de la Vail d'Hebron 171, E-08035 Barcelona (Spain) 

Psychophysical experiments suggested a relative importance of a narrow band of spatial frequen- 
cies for recognition of face identity in humans. There exists, however, no conclusive evidence of why 
it is that such frequencies are preferred. To address this question, I examined the amplitude spectra 
of a large number of face images, and observed that face spectra generally fall off steeper with 
spatial frequency compared to ordinary natural images. When external face features (like hair) are 
suppressed, then whitening of the corresponding mean amplitude spectra revealed higher response 
amplitudes at those spatial frequencies which are deemed important for processing face identity. 
The results presented here therefore provide support for that face processing characteristics match 
corresponding stimulus properties. 
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I. INTRODUCTION 



It has been suggested that the processing of sensory in- 
formation in the brain has adapted to the specific signal 
statistics of stimuh ([l]). Such stimulus-specific adapta- 
tion is tantamount to taking advantage of statistical reg- 
ularities in order to encode the highest possible amount 
of information about the signal ( 0, B [12, HI, [131 ) under 
various constraints. The constraints include, for exam- 
ple, minimizing energy expenditure ([3 [13, [HQ, min- 
imizing wiring costs between processing units ( 19|), or 
reducing spatial and temporal redundancies in the input 
signal (111, ii [13, [H). 

In the case of visual stimuli, natural images reveal a sta- 
tistical regularity that corresponds to an approximately 
linear decrease of their amplitude spectra as a function 
of spatial frequency when scaling both coordinate axis 
logarithmically ([3, [IH). This property is equivalent to 
strong pairwise correlations between pairs of luminance 
values (Ull). It has been proposed that visual neurons 
utilize this statistical property in a way that cells tuned 
to different spatial frequencies have equal sensitivities 
([HI)- Thus, neurons tuned to high spatial frequen- 
cies should increase their response gain such that they 
achieve the same response levels as low frequency neu- 
rons. This is the response equalization hypothesis (which 
should be distinguished from the decorrelation hypothe- 
sis) (P, [13, [II])- Response equalization ("whitening") 
may enhance the information throughput from one neu- 
ronal stage to another by adjusting the output of one 
stage such that it matches the limited dynamic range of 
the successive stage ([l3|). 

The present article unveils a link between statistical prop- 
erties of face images and psychophysical data for the pro- 
cessing of face identity. The processing of face identity 
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was found to preferably depend on a narrow spatial fre- 
quency band (about 2 octaves) from 8 to 16 cycles per 
face (fg, 10, 12, 16, 24, 25, 26, 30]). However, to the best 
of my knowledge, no explanation has been offered yet of 
why it is that face processing mechanisms in the human 
brain reveal such a preference. 

Here I analyzed the amplitude spectra of a large num- 
ber of face images. Different types of amplitude spectra 
were considered - with and without suppression of exter- 
nal face features (hair, shoulders, etc.). The spectra were 
whitened (i.e., "response" -equalized) according to three 
different procedures. In this way it is demonstrated that 
the main results are largely independent of the specific 
method that was used for whitening: amplitudes were 
higher at spatial frequencies around 10 cycles per face 
- but only in those spectra where external face features 
were suppressed. Therefore, the effect must have been 
produced by internal face features (eyes, mouth, nose). 



II. RESULTS 
Amplitude spectra 

Amplitude spectra are best conceived in polar coordi- 
nates, where the spatial frequency k varies proportional 
to radius. Thus, spectral amplitudes which have the same 
spatial frequency lie on a circle. The 2-D spectrum can 
be collapsed into an 1-D isotropic spectrum for each k by 
averaging all amplitudes on that circle. This means that 
in an isotropic spectrum any orientation dependence of 
the amplitudes is lost. 

The amplitude spectra of natural images were found to 
depend on spatial frequency as oc fc", with an average 
(isotropic) spectral slope a w — 1 ([3, [U). 
How do the amplitude spectra of face images compare to 
this finding? To answer, I computed slopes of the am- 
plitude spectra of 868 female and 868 male face images 
(size 256 x 256). In a double- logarithmic representation. 
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FIG. 1: Corrected Blackman-Harris spectrum (females), (a) Logarithmized, mean amplitude spectrum of all female 
face images. Prior to computing individual spectra, a Blackman-Harris (B.H.-) window was applied to each face image in order 
to suppress external face features (Supp. Fig. (7]^). The application of the B.H. -window, however, leaves an undesired spectral 
"fingerprint" in each of the spectra (Supp. Fig. [Sh), which was attenuated before averaging (Supp. Fig. |6]). 
(6) The 2-D spectrum shown on the left is transformed into a 1-D isotropic spectrum by averaging all amplitudes with different 
orientations at a fixed frequency k {— circle symbols). The size of each circle symbol is proportional to the standard deviation 
(s.d.). The maximum s.d. (biggest circle) was 9186.75 (39.3%), and the minimum s.d. (smallest circle) was 252.67 (28.08%). 
In the legend, e denotes slope values a (i.e., a = e). For comparison, the typical slope of natural images (e — —1) is also 
shown as a dashed gray line. The label '^vanilla" refers to line fitting with an ordinary linear regression (least square fit) 
algorithm for computing slopes. Since linear regression is sensitive to outliers, slope values were additionally computed with 
an outlier-insensitive (=robust) algorithm. Finally, the slope for the uncorrected ("raw") amplitude spectrum is also indicated. 



gender 


averaging of: 


raw 


cor r. raw 


B.H. 


corr.B.H. 


female 


slopes 


-1.608±0.0858 


-1.604±0.0870 


-1.686±0.0698 


-1.654±0.0731 




spectra 


-1.584 


-1.579 


-1.701 


-1.668 


male 


slopes 


-1.649±0.0738 


-1.645±0.0757 


-1.673±0.0785 


-1.642±0.0895 




spectra 


-1.644 


-1.637 


-1.689 


-1.658 



TABLE L For each gender, the table shows the average slope values for the four types of amplitude spectra. Two possibilities 
for computing these values were considered: "slopes" means that individual slope values were averaged (each gender n = 868, 
c.f. Supp. Fig. [9]), and "spectra" refers to the slope of the average spectrum as illustrated with Figurellja) . 



these spectra also decreased approximately linear as a 
function of spatial frequency (Figure [1]). Therefore a line 
with (spectral) slope a could be fitted to each spectrum. 
Four different types of amplitude spectra were consid- 
ered for each face image (with different a, see table Hand 
methods section). 

At first the spectra of the original images were computed 
("raw'). The second type of spectrum is defined by atten- 
uating in each spectrum the truncation artifacts ("corr. 
raw", Supp. Fig. [5lc and Supp. Fig. [6|). These arti- 
facts are a consequence of the cropped shoulder region 
being displayed in each image besides the actual face. To 
smoothly strip off external face features (hke the hair, 
i.e. anything but the actual face), a Blackman-Harris 
window was applied to each image prior to computing 
its spectrum ("Blackman-Harris" or "B.H." - see Supp. 
Fig. Off). Because application of the B.H. -window leaves 
a faint but characteristic spectral "fingerprint" (Supp. 
Fig. [Sja), a further spectrum type ("corr. B.H.") was 
considered, with the artificial "fingerprint" being atten- 
uated. 

The mean isotropic slope values were computed in two 



ways. First, the spectral slope of each face image was 
computed, and individual slope values were averaged (la- 
bel "slopes" in table Second, an average spectrum is 
computed at first, which is composed of all individual 
spectra (see Figure [1]). The second slope value corre- 
sponds then to the slope of the average spectrum (la- 
bel "spectra" in table Isotropic slope values are situ- 
ated around —1.6, with minima and maxima of —2.014 
& -1.180 (females), respectively, and -1.994 & -1.007 
(males). 

Notice that the standard deviations associated with the 
slopes of arbitrary natural images are usually bigger 
([3l|, [3^), as there is no restriction on displayed content 
and scale, respectively (^32]). 

Usually, a varies also as a function of orientation Q 
([2^, [3^). The orientation dependence is illustrated by 
means of the averaged corrected spectra (Figure^]). Min- 
imum slope values are located at 0° (wave vector pointing 
to east) and 90° (north), respectively, whereas maxima 
tend to be at oblique orientations. Slope values of the 
B.H. spectra vary more than with the raw spectra. As 
external features are widely suppressed in the B.H. spec- 



3 



corr.raw & corr.B.H. spectral slopes (robust fit) 




s.d. corr.B.H. (m+f) 
mean corr.B.H. (m) 
mean corr.B.H. (f) 
mean corr.B.H. (m+f) 
mean corr.raw (m) 
mean corr.raw (f) 
mean corr.raw (m+f) 
median corr.raw (m+f) 
median corr.B.H. (m+f) 
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FIG. 2; Oriented spectral slope. The curves juxtpose ori- 
ented spectral slopes from corrected raw ( "corr.raw" ) spectra 
and corrected Blackman-Harris- windowed ("corr.B.H.") spec- 
tra (see legend). Slopes were computed from the respective 
averaged spectra, with angular increments of 30° ([l^l)- Error 
bars denote ±1 standard deviation (estimated using robust 
statistics). Uncorrected spectra show similar dependencies of 
slopes from orientation. Notice that slope values are defined 
modulus 180°. 



tra, minimum slopes are associated with the orientations 
of the internal face features (0°,180°: nose; 90°, 270°: 
eyes, mouth, and the bottom termination of the nose). 
Summarizing so far, the majority of the individual a for 
face images is more negative than the theoretically pre- 
dicted lower bound of —1.5 for natural images (Q) (ta- 
ble U Supp. Fig. [H]). Similar observations also hold for 
spectral slopes of the mean amplitude spectra (Figure[T]). 
This should not come as a surprise since the structure of 
face images is different from natural images: face im- 
ages are not composed of self-occluding, constant inten- 
sity surface patches (0, [13)) and lack the self-similar 
distribution of spectral energy as it was reported for nat- 
ural images (|ll|). 



Whitening the Amplitude spectra 

Here I ask whether by amplitude equalization of am- 
plitude spectra ("whitening") one could explain psy- 
chophysical data on face perception. The results which 
are presented below were obtained with the mean spec- 
tra. 

Consider first the isotropic (1-D) spectra. Because the 
spectra fall, as a function of spatial frequency fc, as 
c>c fc~l"l, we can multiply amplitudes by /c'"' to obtain 
a "flat" spectrum (in the sense that its Shannon entropy 
is maximal). The slopes which were used to this end are 
the "spectra" ones from table [J Whitened 1-D spectra 
are shown in Figure [S] They are not completely flat, but 
instead have a global maximum at around 10 cycles per 
face, and a second but local maximum at around 30 cy- 
cles per face. 



FIG. 3: 1-D whitening. Whitening of the corrected mean, 
isotropic 1-D spectra reveals a global amplitude maximum 
at « 10 cycles per face with all four spectra (see legend). 
Symbol size is proportional to standard deviation (relative 
values are indicated in the figure). The slopes which were 
used for whitening are indicated in the legend (c.f. table Ull. 

Consider now the 2-D spectra, where whitening was car- 
ried out according to three different procedures: whiten- 
ing by slopes (analogous to the 1-D case), by variance, 
and by diffusion (see methods section). Results are 
shown in Figure 2] for females, and in Supp. Fig. [TO] 
for males. For both genders, the whitened B.H. -spectra 
reveal amplitude maxima only within a narrow band of 
low spatial frequencies. Furthermore, frequency maxima 
appear only at a specific orientation in the spectra which 
corresponds to horizontally oriented face features ("hor- 
izontal amplitudes", i.e. eyes and mouth). These results 
are obtained independently from the specific whitening 
procedure which was used (sZope-whitening: Figure UJa 
fc Figure llOb : variance-whitenmg: Supp. Fig. 111! diffu- 
sion- whitening: not shown). 

Plotting of only these "horizontal amplitudes" (indicated 
by a white box in Figure |31a) for all three whitening 
procedures allows to identify the spatial frequencies of 
the maxima with higher precision. The curves now show 
clearly that the maxima occur in the range from 10 to 
15 cycles per face height. Nevertheless, maxima are only 
revealed by whitening of the B.H. -windowed spectra, but 
not by whitening of any raw spectra. This means that 
amplitude enhancement due to internal face features is 
annihilated by the presence of external face features (such 
as hair or shoulder). 

III. DISCUSSION 

Here, I studied amplitude spectra of face images in the 
context of response equalization (whitening). Were exter- 
nal face features (hair, shoulder) suppressed by window- 
ing the face images with a Blackman-Harris window, then 
amplitude maxima are observed in the whitened spectra 
at low spatial frequencies. For the isotropic 1-D spec- 
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mrraGted B.H., iaotnopic by abpeifemales 




FIG. 4: 2-D whitening, (a) Slope- whitening of the mean corrected B.H.-spectra unveiled clear maxima at horizontal feature 
orientations (marked by a white box). Here the female data are shown (male data: Supp. Fig. llOp . (b) The curves show the 
amplitudes at the location demarcated by the white box in the spectrum: "log (spectrum)" are the logarithmized amplitudes 
without whitening; amplitudes whitened "by slope", "by variance", and "by diffusion" (see methods sections for further details 
on the three whitening procedures). The important result here is that whitened amplitudes reveal a distinct maxima irrespective 
of the specific whitening method at ~ 10 — 15 cycles per face height. The variance-whitened spectra are shown in Supp. Fig. 1111 



tra, maxima are situated around 10 cycles per face, and 
for the 2-D spectra at around 10 — 15 cycles per face 
height. In the 2-D case, three different whitening meth- 
ods yielded consistent results. 

Several psychophysical studies suggest that recognition 
of face identity works best in a narrow band (bandwidth 
about 2 octaves) of spatial fre quen cies from 8 to « 16 
cycles per face (i, [ll, [H El, lH HE HI). Notice that 
this does not mean that face recognition exclusively de- 
pends on this frequency band, as faces can still be recog- 
nized when corresponding frequency information is sup- 
pressed (diilll). 

Because the amplitude maxima appear in the whitened 
spectra exclusively at horizontal feature orientations, my 
results suggest that the psychophysical frequency prefer- 
ence might have been caused by an adaptation of corre- 
sponding neuronal mechanisms to eyes and mouth. 
Interestingly, in the earlier cited psychophysical studies 
the spatial frequencies are often measured in "cycles per 
face width" (i.e., along vertically oriented face features), 
whereas the results presented here were rather brought 
about by horizontally oriented face features. The factors 
to convert spatial frequencies from "cycles per image" to 
"cycles per face" (see methods) are statistically different 
for width and height (as suggested by a one-way ANOVA 
and a Kruskal-Wallis test). However, they are not so dif- 
ferent in absolute terms. The aforementioned frequency 
interval of 10 — 15 cycles per face height transforms into 
K, 9 — 13.5 cycles per face width for females and « 9—13.6 
cycles per face width for males, respectively, what is still 
in good agreement with the psychophysical data. 
Psychophysical thresholds for face recognition are not 
significantly affected by the structure of the background 
in which a face is embedded (Q). Therefore, although 
the faces used in this study are shown against an uni- 
form background, the validity of results should extend 



to arbitrary backgrounds. Notice, however, that ampli- 
tude spectra consider the complete frequency content of 
an image, whereas humans have attentional mechanisms 
which allows them to process only a region of interest, 
and ignore background effects. Windowing the face im- 
ages with a Blackman-Harris window achieves the the 
same computational purpose: anything but the internal 
face features are suppressed. A follow-up paper examines 
in more detail the properties of internal face features by 
means of a model of simple and complex cells. 
The statistical prediction of a preferred band of spatial 
frequencies may also have implications for artificial face 
recognition systems. Future experiments should system- 
atically address the question whether the recognition per- 
formance of artificial systems is optimal at spatial fre- 
quencies similar to those used by humans. 



Methods 

Face images. We used 868 female face images, and 868 male 
face images from the Face Recognition Grand Challenge database 
(FRGC, http://www.frvt.org/FRGC or www.bee-biometrics.org). 
Original images (1704 X 2272 pixels, 24-bit true color) were ad- 
justed for horizontal alignment of eyes, before they were down- 
sampled to 256 X 256 pixels and converted into 8-bit grey-scale. 
Subsequently, the positions of left eye, right eye, and mouth 
[(xie,yie), {xre,yre), and [x mouth , V mouth) , respectively] were man- 
ually marked by two persons (M.S.K. and E.G.) with an ad hoc 
programmed graphical interface. The position of each face center 
(^ nose) was approximated as Xnose = rnd((2:;e-|-a:re)/4+a:„o„(h/2) 
and y„ose = rnd[0.95 * rnd({/fc -|- {y^outh - {Vle + !/re)/2)/2), where 
rnd( ) denotes rounding to the nearest integer value. 

Dimension of spatial frequency. For conversion of spatial 
frequency units, face dimensions were manually marked with an 
ad hoc programmed graphical interface. The factors for multi- 
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plying "cycles per image" to obtain "cycles per face width" were 
0.41 ± 0.013 (females, n = 868) and 0.43 ± 0.012 (males, n = 868). 
Corresponding factors for obtaining "cycles per face height" were 
0.46±0.021 (females) and 0.47 ± 0.018 (males). Conversion factors 
at oblique orientations were calculated under the assumption that 
horizontal and vortical conversion factors define the two main axis 
of an ellipse. Pooling of results over gender implied also a corre- 
sponding averaging of conversion factors, and the factors for width 
and height were averaged in the isotropic case. 

Amplitude spectra. Let the features which are not part of the 
actual face be denoted by external features (e.g., shoulder region 
or hair). On the other hand, internal features refer to the eyes, the 
mouth, and the nose. The presence of external features in our face 
images influences in their amplitude spectra, and may cause trun- 
cation artifacts. It is thus desirable to compare results with and 
without the presence of external features. A good suppression of 
external features could be achieved by centering a minimum 4--term 
Blackman- Harris window ([3) at {xnose,ynose) (Supp. Fig. [7]& 
|8}. Nevertheless, application of the window leaves a characteris- 
tic "fingerprint" in each spectrum (Supp. Fig. [5ji). This artifi- 
cial "fingerprint", as well as the spurious lines caused by trunca- 
tion, could be attenuated with a correction procedure based on a 
spatially varying diffusion mechanism (outlined below). Thus, for 
each face image, originally four types of amplitude spectra were 
considered: the original "raw" spectrum, the "Blackman- Harris" - 
spectrum, and their respective corrected versions (i.e., "corr. raw" 
and "corr. B.H."). 

Correction of Amplitude Spectra. Let P e {0,1}"^" be a 
binary n X n matrix of the same size as the 2-D amplitude spectra 
A. In P, artifacts are represented by ones, while all other posi- 
tions are set to zero. Thus, P is set to the image shown in Supp. 
Fig. [5j&) for correcting the Blackman-Harris spectrum, and Supp. 
Fig. Efc) for the raw spectrum. The idea of the correction algo- 
rithm consists in simply averaging out the positions with artifacts. 
To this end, information from neighboring positions flows into arti- 
fact positions. This process is called inward diffusion. Let X{t) be 
a sequence of corrected amplitude spectra parameterized over time 
t, with the initial condition X{0) = A.. Inward diffusion is defined 
by dXij/dt = PijV^A^j, where {i,j) denotes matrix positions. The 
diffusion process was terminated at the moment when the correla- 
tion difference c{t) — c(t + At) was smaller than 0.001, or when a 
maximum of 100 iterations were done. 

Slopes of amplitude spectra. Isotropic slopes a: Amplitudes 
associated with a given spatial frequency lie on a circle. This is to 
say that when representing the spectrum with polar coordinates, 
then spatial frequencies vary along the radial coordinate, but stay 
constant while varying orientation. An isotropic amplitude spectra 
is obtained by averaging all amplitudes with a fixed spatial fre- 
quency across orientations (i.e., for each circle, the mean value of 
all amplitudes of the circle was computed). Because the logarith- 
mized amplitude spectra of face images fall approximately linear as 
a function of log-frequency, a line with slope a could be fitted to the 



isotropic spectra. Although in principle amplitude data were avail- 
able from A; = 1 to A; = 127 cycles per image, only the interval from 
kmin = 8 to kmax = 100 was used for line fitting. I used the func- 
tion "robustf it" (linear regression with low sensitivity to outliers) 
provided with Matlab's statistical toolbox (Matlab version 7.1.0.183 
R14 SP3, Statistical Toolbox version 5.1, see www.mathworks.com). 
Oriented spectral slopes «(©) (Figure[2]l: Each 2-D amplitude spec- 
trum was subdivided into 12 "pie slices" (each with A© = 30°). 
For each pie slice with orientation 0, an (oriented) isotropic 1-D 
spectrum was analogously computed as just described (with am- 
plitudes being averaged across arcs), and subsequently a line with 
slope o(©) was fitted. 

Slope- Whitening of Amplitude Spectra. This algorithm pro- 
ceeds in straight analogy to whitening of the isotropic spectra. 
Let a be the isotropic slope value corresponding to a 2-D ampli- 
tude spectrum A{kj;,ky) with spatial frequency coordinates kx, 
ky S [1,127] cycles per image. Let k = ^ k'^ + k"^ (radial spatial 
frequency). Then, the corresponding whitened spectrum W is de- 
fined as Wikxyky) = A{kx,ky) ■ fel^l. Qualitatively, the W were 
not different from a more advanced procedure that consisted in 
subdividing A into oriented "pie slices" and whitening each with 
its corresponding oriented slope value a(0). Therefore, only those 
results are presented where A was whitened with an isotropic slope 
value (the term "isotropic" in the headline of the spectra in Fig- 
ure HI and Supp. Fig. [TD] indicates just this). 

Whitening by Variance. Amplitudes in the spectrum A{kx, ky) 
with equal spatial frequencies lie on a circle with radius k = 
y'fcl"-!-'^^. Let nfe be the number of points on this circle (n^, mono- 
tonically increases as a function of A;). Let A{k, 0) be the spectrum 
in polar coordinates. Then, we first average, for each k, all am- 
plitudes across orientations according to tJ.{k) = X]e "^C^' ■ 
The variance is subsequently computed as cr'^{k) = ®) ~ 

/x)^/(nfc — 1). Finally, the variance- whitened spectrum is defined as 
V = A/ {cr'^ (fc) + e) with a small positive constant e ^ 1. Examples 
of V are shown in Supp. Fig. 1111 

Whitening by Diffusion. Let X{kx,ky,t) a sequence of ampli- 
tude spectra parameterized over time t, with the initial condition 
X{kx,ky,0) = A{kx, ky). For t > 0, the X are defined according 
to the diffusion equation dX/dt = V^A". The whitened spectrum 
then is V = A/(l + X{t max)) at precisely the instant tmax when 
the Shannon entropy of T> is maximal. 
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Supplementary Figures 



FIG. 5: Artifacts in the amplitude spectra, (a) The log-amplitude-spectrum of the minimum 4-term Blackman-Harris 
(B.H.) window reveals a characteristic "fingerprint" (shown in this image), which also emerges when averaging a big number of 
amplitude spectra of B.H. -windowed faces, (b) The "fingerprint" is transformed into a binary image by thresholding with —0.25 
(black color indicates values with 1, and white indicates 0). (c) Manually marked line artifacts which appear by averaging the 
amplitude spectra of a big number of face images (here without windowing). 




FIG. 6: Suppressing artifacts in the amplitude spectra. This figure illustrates how artifacts in the amplitude spectra are 
suppressed by a nonlinear diffusion process, where the thresholded images of Supp. Fig. [5] served as spatially variant diff'usion 
coefficient (see methods section), (a) Original face image, (b) The log-amplitude-spectrum of the image has horizontal and 
vertical lines which are generated as a consequence of truncating the shoulder region (c.f. Supp. Fig. [5ls). (c) The spectrum 
after one iteration of nonlinear diffusion, with a difference in correlation to the original spectrum Ac(l) = c(0) — c(l) = 0.24052. 
The spurious lines are already attenuated, (d) Three iterations with Ac(3) = 0.05905. (e) 12 iterations with Ac(12) < 0.001, 
which is the stopping criterion. The artificial lines are largely suppressed. The rest of the amplitude spectrum remains intact, 
and more interesting structures are now visible. 
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FIG. 7: Suppression of external face features. The images in tlie bottom row (e-/i) siiow tlie logaritlimized amplitude 
spectra of the images {a~d) (face ID 104). The amplitude spectrum (e) of the original image (a) shows spurious horizontal 
and the vertical lines. (6) The spurious vertical line disappeared in the amplitude spectrum (/) when the shoulder region was 
manually erased, and the horizontal line then had a smaller amplitude, (c) Erasing all external face features led to the creation 
of a "moonface" , thereby suppressing all of the artificial lines (g) . Finally, in (ci) , a minimum 4-term Blackman- Harris window 
was centered at the nose position of the original image. The corresponding amplitude spectrum [h) of the windowed image is 
very similar to the amplitude spectrum of the "moonface" spectrum (but see Supp. Fig. [Si . 
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FIG. 8: Similarities between the logarithmized amplitude spectra of "moonfaces" and windowed faces. For six se- 
lected female images which revealed strong line artifacts in their amplitude spectra, I computed similarity measures between the 
logarithmized amplitude spectra of corresponding "moonfaces" (e.g., Supp. Fig. [Tj;) and windowed faces (e.g., Supp. Fig. [Tfi; 
the window type is specified by the numbers at the abscissae) . The plot shows mutual information averaged across the the six 
images (mean ± s.d.). The center of the window was either positioned always at the center position of each image ("rigid"), 
or at the nose position with variable radius ("adaptive") - see legend. The minimum 4-term Blackman- Harris window scored 
the highest similarity (indicated by a red star). With correlation instead of mutual information, the curves show nearly the 
same relative similarities. In that case, the maximum average correlation value (± s.d.) was 0.87 ± 0.02 again for the adaptive 
minimum 4-term Blackman- Harris window. 

The identification numbers ("Fourier-IDs") of the windows were 1—Chebyshev window, 2=Nuttall-defined minimum 4-term 
Blackman-Harris window, 3=Bohman window, A—Parzen (de la Valle-Poussin) window, 5=minimum ^-term Blackman- 
Harris window, 6— Blackman window, 7=modified Bartlett-Hann window, 8—Hann (Hanning) window, 9=triangular window, 
lO=Bartlett window, 11— Gaussian window, 12=flat top weighted window, 13— Hamming window, 14=Tukey (tapered cosine) 
window, 15— Kaiser window, 16=sharp-edged disk. 
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e=-l.S4.9± 0.0738, P=0.001 [8S8 samples] 



e=-l.B4S± 0.0757, P=0.001 [8B8 samples] 
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FIG. 9: Slopes for individual images (male faces). The histograms show the probabihty of occurrence of slope values 
e = a across all 868 male face images. For each face image, a corresponding slope value was obtained from fitting a line to the 
double-logarithmic representation of its isotropic 1-D amplitude spectrum (frequency range for fitting from 8 to 100 cycles per 
image). The centered vertical line in each histogram is the average a, and the fianking lines denote ±1 s.d., respectively. A 
Jarque-Bera test was used to test the slope values for normal distribution (this test could be applied because of our large sample 
size) - corresponding P- values are indicated with each histogram. Corresponding histograms for female images are similar, (a) 
Raw spectrum: a = -1.649 ± 0.0738, P < 0.001. (b) Corrected raw: a = -1.645 ± 0.0757, P < 0.001. (c) Blackman-Harris: 
a = -1.673 ± 0.0785, P = 0.03. (d) Corrected Blackman-Harris; a = -1.642 ± 0.0895, P < 0.001. 
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FIG. 10: Whitening by slope. Analogous to figure [4] but for face images of males. 
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FIG. 11; Whitening by variance. Analogous to figure |4l^a) (females - left panel) and figure fTOT al (males - right panel) 
but here for variance-whitening. Again, as witli the slope-whitenend spectra, maxima are revealed at low spatial frequencies 
for horizontally oriented features (as indicated by the white regions close the center). 



